Teaching authentic data science without prerequisites

Matthew Beckman
Penn State University

Daniel Kaplan
Macalester College

U.S. Conference on Teaching Statistics
University Park, PA
May 20, 2017

Background: Introduction to R (Penn State)

Structure

Students

Thoughts before, during, and after course

Background: Data Computing (Macalester)

Tools: Working code

Tools: RMarkdown

Tools: Other Resources

Sample Activities

https://mdbeckman.github.io/USCOTS2017Breakout/

Student Outcomes: Core Skills

Student Outcomes: Broad Exposure

PSU Final Project

FIFA World Rankings Analysis

Movies in the 21st Century

Stanley Cup Winners

2013 FBI Crime Reporting

Vegetarian Restaurant Analysis

# Student Code
data1<-unique(data) 
data2<-na.omit(data1)

RestaurantMap <-
 leaflet(data2) %>%
 addTiles() %>%
 addCircleMarkers(radius = 2, color = "red") %>%
 setView( lng =-73.935242, lat =40.730610, zoom = 12)  #New York
## Assuming 'longitude' and 'latitude' are longitude and latitude, respectively

Vegetarian Restaurant Analysis (Static)

Other interesting projects

Leading Causes of Death in NYC

Analysis of Thanksgiving

MLB Free Agent Analysis

History of Reddit

Student Feedback

Course Details

Introduction to R (Penn State)